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NOVEL NAGPU 

FIELD OF THE INVENTION 

This invention relates to newly identified polynucleotides, polypeptides encoded by 
such polynucleotides, the use of such polynucleotides and polypeptides, as well as the 
5 production of such polynucleotides and polypeptides and recombinant host cells 

transformed with the polynucleotides. The invention also relates to inhibiting the action of 
such polypeptides and to the use of inhibitors in therapy. 
BACKGROUND OF THE INVENTION 

N-Acetylglucosamine 1 -Phosphate UridyHransferase (NAGPU) catalyzes the 

1 0 formation of UDP-N-Acctyl Glucosamine, an essential precursor for cell wall 

peptidoglycan in all bacteria and of lipopolysaccharide and enterobacterial common antigen 
in gram-negatives. The enzyme has been purified from Escherichia coli and is Afunctional, 
also catalyzing the preceding step of N-acetylation of glucosamine- 1 -phosphate (Mengin- 
Lecreulx, D. and van Heijenoort, J, J. Bacterid. 176: 5788-5795 [1994]). it is possible to 

1 5 block the acety transferase activity but not the uridyl transferase activity with thiol 

inhibitors, suggesting that the enzyme may have two domains. The gene glmU encoding 
the enzyme has been cloned from £. coli (Mengin-Lecreulx, D. and van Heijenoort, J, 
J.Bacteriol. 175: 6150-6157 [1993]) and its counterpart in Bacillus subtilis (gcaD) has also 
been identified (Hove- Jensen B, J.Bacteriol. 174: 6852-6 [1992]). 

20 The essential nature of the gene product of gcaD is demonstrated by temperature 

sensitive mutants of Bacillus subtilis which are unable to make active enzyme and stop 
growing at the restrictive temperature (Hove-Jensen [1992]). The discovery of the gene 
from the human pathogen Staphylococcus aureus corresponding to gcaD permits one to 
produce NAGPU enzyme which can be used to screen for novel antibiotics as described 

25 below. 

Recently, several novel approaches have been described which purport to follow 
global gene expression during infection (Chuang, S. et al. [1993] Global Regulation of 
Gene Expression in Escherichia coli J. Bacterid. 1 75, 2026-2036, Mahan, MJ. et al. 
[1993] Selection of Bacterial Virulence Genes That Are Specifically Induced in Host 

30 Tissues SCIENCE 259, 686-688, Hensel, M. et al. [1995] Simultaneous Identification of 
Bacterial Virulence Genes by Negative Selection SCIENCE 269, 400-403). These new 
techniques have so far been demonstrated with gram negative pathogen infections and not 
with infections with gram positives presumably due to the much slower development of 
global transposon mutagenesis and suitable vectors needed for these strategies in these 

35 organisms, and in the case of that process described by Chuang. S. et al.[1993] the 

difficulty of isolating suitable quantities of bacterial RNA free of mammalian RNA derived 
from the infected tissue to furnish bacterial RNA labeled to sufficiently high specific 

1 
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activity. The present invention employs a novel technology to determine gene expression 
in the pathogen at different stages of infection of the mammalian host. A novel aspect of 
this invention is the use of a suitably labeled oligonucleotide probe which anneals 
specifically to the bacterial ribosomal RNA in Northern blots of bacterial RNA preparations 
5 from infected tissue. Using the more abundant ribosomal RNA as a hybridization target 
greatly facilitates the optimization of a protocol to purify bacterial RNA of a suitable size 
and quantity for RT-PCR from infected tissue. 

A suitable oligonucleotide useful for applying this method to genes expressed in 
Staphylococcus aureus is, for example, 5*-gctcctaaaaggttactccaccggc-3' [SEQ ID NO:6J. 

10 Use of the technology of the present invention enables identification of bacterial 

genes transcribed during infection, inhibitors of which would have utility in anti-bacterial 
therapy. Specific inhibitors of such gene transcription or of the subsequent translation of 
the resultant mRNA or of the function of the corresponding expressed proteins would have 
utility in anti-bacterial therapy. 

1 5 SUMMARY OF THE INVENTION 

The present invention relates to a novel protein from S. aureus WCUH 29, 
characterized in that it comprises an amino acid sequence selected from the group 
consisting of the amino acid sequence set forth in SEQ ID NO:2 and SEQ ID NO:5, or a 
fragment, analogue or derivative thereof either. This protein and any variants thereof, as 
20 well as polynucleotides encoding the same are herein referred to as "NAGPU." 

The invention also relates to a polypeptide fragment of the protein, having the 
amino acid sequence selected from the group consisting of the amino acid sequence set 
forth in SEQ ID NO:2 and SEQ ID NO:5, or a derivative thereof. 

In accordance with another aspect of the present invention, there are provided 
25 polynucleotides (DNA or RNA) which encode such polypeptides. 

In particular the invention provides a polynucleotide having the DNA sequence 
given in SEQ ID NO: I , and a complementary sequence thereto. 

The invention also provides an isolated polynucleotide comprising a member selected 
from the group consisting of: a polynucleotide having at least a 70% identity to a 
30 polynucleotide encoding a polypeptide comprising amino acids 1 to 450 of SEQ ID NO:2: a 
polynucleotide having at least a 70% identity to a polynucleotide encoding a polypeptide 
comprising amino acids I to 452 of SEQ ID NO:5; a polynucleotide which is complementary 
to the polynucleotide of any such isolated polynucleotide; and a polynucleotide comprising at 
least 1 5 sequential bases of the polynucleotide of any of such isolated polynucleotides. 
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Further provided is a polynucleotide comprising a polynucleotide sequence selected 
from the group consisting of the complementary sequence to nucleotide 54 to 1 406 set forth in 
SEQ ID NO: I. and a complementary sequence to nucleotide 54 to 1412 set forth in SEQ ID 
NO: 1. 

5 In particular the invention provides a polynucleotide which encodes a polypeptide 

comprising amino acid selected from the group consisting of amino acid I to 450 of SEQ ID 
NO:2 and amino acid 1 to 452 of SEQ ID NO:5. 

The invention provides an isolated polynucleotide comprising a member selected 
from the group consisting of: a polynucleotide having at least a 70% identity to a 
1 0 polynucleotide encoding the same mature polypeptide expressed by the DN A contained in 
NCIMB Deposit No. 40794 and having the polynucleotide sequence of SEQ ID NO: ) ; a 
polynucleotide complementary to the polynucleotide of any such isolated polynucleotide; and 
a polynucleotide comprising at least 15 bases of the polynucleotide of such isolated 
polynucleotides. 

1 5 Also provided by the invention is a polypeptide comprising an amino acid sequence 

which is at least 70% identical to amino acid selected from the group consisting of 1 to 450 of 
SEQ ID NO:2 and amino acid 1 to 452 of SEQ ID NO:5. 

Further provided by the invention is a polypeptide comprising an amino acid 
sequence selected from the group consisting of the amino acids as set forth in SEQ ID NO:2 
20 and the amino acids as set forth in SEQ ID NO:5. 

The present invention also provides a novel protein from Staphylococcus, aureus 
WCUH29 obtainable by expression of a gene characterized in that it comprises the DNA 
sequence given SEQ ID NO: 1, or a fragment, analogue or derivative thereof. 

The invention also relates to novel oligonucleotides, including those set forth in 
25 SEQ ID NO: 3, 4 and 6, SEQ ID NO:3 and 4 of which are derived from the polynucleotide 
sequence of SEQ lDNO:l. 

The present invention includes variants of the hereinabove described 
polynucleotides which encode fragments, analogs and derivatives of the polypeptide 
characterized by the deduced amino acid sequence selected from the group consisting of the 
30 amino acid sequence set forth in SEQ ID NO:2 and SEQ ID NO:5. 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 

3 
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Also provided is an antibody aga.nst a polypeptide selected from the group 
consisting of the amino acid sequence set forth in SEQ ID NO:2 and SEQ ID NO:5. Still 
further provided is an antagonist which inhibits the activity of the polypeptide selected from 
the group consisting of the amino acid sequence set forth in SEQ ID NO:2 and SEQ ID 
NO:5. 

A method is also provided for the treatment of an individual having need to inhibit 
novel NAGPU polypeptide comprising: administering to the individual a therapeutically 
effective amount of an antagonist against the polypeptide of the invention. 

Provided is a process for diagnosing a disease related to expression of the polypept.de 
of the invention comprising: determ.ning a nucleic acid sequence encoding the polypeptide 
selected from the group consisting of the amino acid sequence set forth in SEQ ID NO:2 
and SEQ ID NO:5. 

A diagnostic process is provided comprising: analyzing for the presence of the 
polypeptide selected from the group consisting of the ammo acid sequence set forth in SEQ 
1 5 ID NO:2 and SEQ ID NO:5. either in a sample derived from a host. 

In accordance with yet a further aspect of the present invention, there is provided 
the use of a polypeptide of the invention for therapeutic or prophylactic purposes, for 
example, as an antibacterial agent or a vaccine. 

In accordance with another aspect of the present invention, there is provided the 
use of a polynucleotide of the invention for therapeutic or prophylactic purposes, in 
particular genetic immunization. 

In accordance with yet another aspect of the present invention, there are provided 
inhibitors to such polypeptides, useful as antibacterial agents. 

Another aspect of the invention is a pharmaceutical composition comprising the 
above polypeptide, polynucleotide or inhibitor of the invention and a pharmaceutically 
acceptable carrier. 

In a particular aspect the invention provides the use of the polypeptide, 
polynucleotide or inhibitor of the invention to interfere with the immediate phys.cal 
interaction between a pathogen and mammalian host responsible for sequelae of infection. 
30 The invention further relates to the manufacture of a medicament for such uses. 

This invention provides a method of screening drugs to identify those which 
interfere with the interaction of the protein or active fragment to mammalian cells. 



20 
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Further provided is a method for identifying compounds which bind to and inhibit an 
activity of the polypeptide selected from the group consisting of the amino acid sequence set 
forth in SEQ ID NO:2 and SEQ ID NO:5, the method comprising: contacting a cell 
expressing on the surface thereof a binding means for the polypeptide, said binding means 
> be ins associated with a second component capable of providing a detectable signal in 
response to the binding of a compound to said binding means, with a compound to be 
screened under conditions to permit binding to the binding means; and determining whether 
the compound binds to and activates or inhibits the binding by detecting the presence or 
absence of a signal generated from the interaction of the compound with the binding means. 
|i» BRIEF DESCRIPTION OF THE DRAWINGS 

I he following drawings depict certain embodiments of the invention. They are 
illustrative only and do not limit the invention otherwise disclosed herein. 

Figure I i A-C) shows the polynucleotide sequence of novel S. aureus NAGPU gene 
[SEQ ID NO: I ] in the antisense orientation. The complementary sequences of the two start 
1 5 codons are shown in bold and underlined and the complementary sequence of the stop codon 
is underlined. Figure 1 A-C inclusive is referred to herein as "Figure 1" 

Figure 2 shows a polypeptide sequence of a novel NAGPU protein [SEQ ID NO:2] 
deduced from the polynucleotide sequence of Figure 1 [SEQ ID NO: 1]. 

Figure 3 shows the oligonucleotide primers [SEQ ID NO: 3 and 4] derived from the 
20 polynucleotide sequence of Figure I [SEQ ID NO: 1]. 

Figure 4 shows a polypeptide sequence of a novel NAGPU protein [SEQ ID NO:5] 
having an alternate amino terminus deduced from the polynucleotide sequence of Figure 1 
[SEQ ID NO: 1 1. 

DETAILED DESCRIPTION OF THE INVENTION 

25 The present invention relates to a novel N-acetylglucosamine- 1 -phosphate 

uridyltransferase protein from Staphylococcus aureus WCUH29, characterized in that it 
comprises an amino acid sequence selected from the group consisting of that given in SEQ 
ID NO:2 and SEQ ID NO:5, or a fragment, analogue or derivative thereof cither. The 
amino acid sequences of SEQ ID NO: 2 and 5 are two possible translated open reading 

30 frame sequences of SEQ ID NO: 1 and display homology of about 55% identity to the B. 

subtUis NAGPU enzyme sequence. Two preferred NAGPU enzymes of the invention differ 
at their amino terminus, each begining with a different terminal metionine. SEQ ID NO:2 
and 5 show these two forms. 



5 
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S. aureus WCUH 29 has been deposited at the National Collection of Industrial and 
Marine Bacteria Ltd. (NCIMB). Aberdeen. Scotland under number NCIMB 40771 on I 1 
September 1995. 

The invention also relates to a polypeptide fragment of the NAGPU regulator 
protein, having the amino acid sequence selected from the group consisting of SEQ ID 
NO:2 and 5, or a derivative thereof either. 

Hereinafter the term polypeptide^ ) will be used to refer to the NAGPU regulator 
protein, its fragments, analogues or derivatives as wed as the polypeptide fragment or its 
derivatives. 

In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode such polypeptides. 

In particular the invention provides a polynucleotide having the DNA sequence 
given in SEQ ID NO: 1 The invention further provides a polynucleotide encoding a 
NAGPU protein from S. aureus WCUH 29 and characterized in that it comprises the DNA 
1 5 sequence given in SEQ ID NO: 1 . 

The present invention also provides a novel protein from Staphylococcus, aureus 
WCUH29 obtainable by expression of a gene characterized in that it comprises the DNA 
sequence given SEQ ID NO: 1 , or a fragment, analogue or derivative thereof. 

The invention also relates to novel oligonucleotides, including, for example, SEQ 
20 ID NO: 3 and 4, derived from the sequence SEQ ID NO: 1 which can act as PCR primers in 
the process herein described to determine whether or not the Staphylococcus aureus genes 
identified herein in whole or in part are transcribed in infected tissue. The invention also 
relates to novel oligonucleotides, for example. SEQ ID NO: 6, that can act as hybridizations 
probes. It is recognized that such sequences will also have utility in diagnosis of the stage 
25 of infection and type of infection the pathogen has attained. 

The polynucleotide having the DNA sequence given in SEQ ID NO: 1 was obtained 
from the sequencing of a library of clones of chromosomal DNA of S.aureus WCUH 29 in 
E.coli. It has been demonstrated by the process herein described that it is transcribed in 
vivo in an established infection of S.aureus WCUH29 in a mouse model of infection. 
30 To obtain the polynucleotide encoding the protein using the DNA sequence given 

in SEQ ID NO: 1 typically a library of clones of chromosomal DNA of S.aureus WCUH 
29 in E.coli or some other suitable host is probed with a radiolabeled oligonucleotide, 
preferably a 1 7-mer or longer, derived from the partial sequence. Clones carrying DNA 
identical to that of the probe can then be distinguished using high stringency washes. By 
sequencing the individual clones thus identified with sequencing primers designed from the 
original sequence it is then possible to extend the sequence in both directions to determine 
the full gene sequence. Conveniently such sequencing is performed using denatured double 

6 
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stranded DNA prepared from a plasmid clone. Suitable techniques are described by 
Maniatis. T.. Fritsch, E.F. and Sambrook, J. in MOLECULAR CLONING, A Laboratory 
Manual [2nd edition 1989 Cold Spring Harbor Laboratory, see Screening By Hybridization 
1.90 and Sequencing Denatured Double-Stranded DNA Templates 13.70]. 
5 The polynucleotide of the present invention may be in the form of RNA or in the 

form of DNA, which DNA includes cDNA, genomic DNA, and synthetic DNA. The DNA 
may be double-stranded or single-stranded, and if single stranded may be the coding strand 
or non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide 
may be identical to the coding sequence shown in SEQ ID NO: I or may be a different 

10 coding sequence which coding sequence, as a result of the redundancy or degeneracy of the 
genetic code, encodes the same polypeptide. 

The present invention includes variants of the hereinabove described 
polynucleotides which encode fragments, analogues and derivatives of the polypeptide 
characterized by the deduced amino acid sequence selected from the group consisting of 

1 5 SEQ ID NO:2 and/or 5. The variant of the polynucleotide may be a naturally occurring 
allelic variant of the polynucleotide or a non-naturally occurring variant of either 
polynucleotide. 

Thus, the present invention includes polynucleotides encoding the same 
polypeptide characterized by the deduced amino acid sequence selected from the group 
20 consisting of SEQ ID NO:2 and/or 5, as well as variants of such polynucleotides which 
variants encode for a fragment, derivative or analogue of either polypeptide. Such 
nucleotide variants include deletion variants, substitution variants and addition or insertion 
variants. 

The polynucleotide may have a coding sequence which is a naturally occurring 
25 allelic variant of the coding sequence characterized by the DNA sequence of SEQ ID NO: 1 . 
As known in the art, an allelic variant is an alternate form of a polynucleotide sequence 
which may have a substitution, deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded polypeptide. 

The polynucleotide which encodes for the mature polypeptide, i.e. the N AGPU 
30 protein, may include only the coding sequence for the mature polypeptide or the coding 
sequence for the mature polypeptide and additional coding sequence such as a leader or 
secretory sequence or a proprotein sequence. 

Thus, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
35 polynucleotide which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding 
sequence for the mature polypeptide may be fused in the same reading frame to a 

7 
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polynucleotide sequence which aids in expression and secrct.on of a polypeptide from a 
host cell, for example, a leader sequence which functions as a secretory sequence for 
controlling transport of a polypeptide from the cell. The polypeptide having a leader 
sequence is a prcprotein and may have the leader sequence cleaved by the host cell to form 
the mature form of the polypeptide. The polynucleotides may also encode for a proprotein 
which is the mature protein plus additional 5' amino acid residues. A mature protein having 
a prosequence is a proprotein and is an inactive form of the protein. Once the prosequence 
is cleaved an active mature protein remains. 

Thus, for example, the polynucleotide of the present invention may encode for a 
mature protein, or for a protein having a prosequence or for a protein having both a 
prosequence and a presequence (leader sequence). During post-translational modification 
ot the peptide, a methionine residue at the NH 2 -terminus may be deleted. Accordingly, this 
invention contemplates the use of both the methionine-containing and the methionineless 
amino terminal variants of the protein of the invention. 
1 D The polynucleotides of the present invention may also have the coding sequence 

fused in frame to a marker sequence at either the 5' or 3' terminus of the gene which allows 
for purification of the polypeptide of the present invention. The marker sequence may be a 
hexa-histidine tag supplied by the pQE series of vectors (supplied commercially by 
Quiagen Inc.) to provide for purification of the polypeptide fused to the marker in the case 
20 of a bacterial host. 

The present invention further relates to polynucleotides which hybridize to the 
hereinabove-described sequences if there is at least 50% and preferably at least 70% 
identity between the sequences. The present invention particularly relates to 
polynucleotides which hybridize under stringent conditions to the hereinabove-described 
25 polynucleotides . As herein used, the term "stringent conditions" means hybridization will 
occur only if there is at least 95% and preferably at least 97% identity between the 
sequences. The polynucleotides which hybridize to the hereinabove described 
polynucleotides in a preferred embodiment encode polypeptides which retain substantially 
the same biological function or activity as the polypeptide characterized by the deduced 
30 amino acid sequence selected from the group consisting of SEQ ID NO:2 and/or 5. 

The deposit referred to herein will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Micro-organisms for purposes of 
Patent Procedure. These deposits are provided merely as convenience to those of skill in 
the art and are not an admission that a deposit is required under 35 U.S.C. § 1 12. The 
sequence of the polynucleotides contained in the deposited material, as well as the amino 
acid sequence of the polypeptides encoded thereby, are incorporated herein by reference 
and are controlling in the event of any conflict with any description of sequences herein. A 

8 
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license mav be required to make, use or sell the deposited material, and no such license is 
hereby granted. 

The terms "fragment," "derivative" and "analogue" when referring to the 
polypeptide characterized by the deduced amino acid sequence selected from the group 
5 consisting of SEQ ID N0.2 and/or 5. means a polypeptide which retains essentially the 
same biological function or activity as such polypeptide. Thus, an analogue includes a 
proprotein which can be activated by cleavage of the proprotein portion to produce an 
active mature polypeptide. 

The polypeptide of the present invention may be a recombinant polypeptide, a 
10 natural polypeptide or a synthetic polypeptide, preferably a recombinant polypeptide. 

The fragment, derivative or analogue of the polypeptide characterized by the 
deduced amino acid sequence selected from the group consisting of SEQ ID NO:2 and/or 5 
may be (i) one in which one or more of the amino acid residues are substituted with a 
conserved or non-conserved amino acid residue (preferably a conserved amino acid 
1 5 residue) and such substituted amino acid residue may or may not be one encoded by the 
genetic code, or (ii) one in which one or more of the amino acid residues includes a 
substituent group, or (iit) one in which the polypeptide is fused with another compound, 
such as a compound to increase the half-life of the polypeptide (for example, polyethylene 
glycol), or (iv) one in which the additional amino acids are fused to the polypeptide, such as 
20 a leader or secretory sequence or a sequence which is employed for purification of the 
polypeptide or a proprotein sequence. Such fragments, derivatives and analogues are 
deemed to be within the scope of those skilled in the art from the teachings herein. 

The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 
25 The term "isolated" means that the material is removed from its original 

environment (e.g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotides could be part of a vector 
30 and/or such polynucleotides or polypeptides could be part of a composition, and still be 
isolated in that such vector or composition is not part of its natural environment. 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 
35 In accordance with yet a further aspect of the present invention, there is therefore 

provided a process for producing the polypeptide of the invention by recombinant 
techniques by expressing a polynucleotide encoding said polypeptide in a host and 

9 



BNSDOCID: <WO 9721819A1 J_> 



WO 97/21819 



PCT/GB96/03047 



recovering the expressed product. Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 

Host cells are genetically engineered (transduced or transformed or transfectcd) 
with the vectors of this invention which may be. for example, a cloning vector or an 
5 expression vector. The vector may be, for example, in the form of a plasmid, a cosmid. a 
phage, etc. The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters, selecting transformants or amplifying the 
genes. The culture conditions, such as temperature. pH and the like, are those previously 
used with the host cell selected for expression, and will be apparent to the ordinarily skilled 
1 0 artisan. 

Suitable expression vectors include chromosomal, nonchromosomal and synthetic 
DNA sequences, e.g., bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA. However, any other vector may 
be used as long as it is replicable and viable in the host. 

1 5 The appropriate DNA sequence may be inserted into the vector by a variety of 

procedures. In general, the DNA sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 

20 examples of such promoters, there may be mentioned: LTR or SV40 promoter, the £. colt, 
lac or trp, the phage lambda Pr^ promoter and other promoters known to control expression 
of genes in eukaryotic or prokaryotic cells or their viruses. The expression vector also 
contains a ribosome binding site for translation initiation and a transcription terminator. 
The vector may also include appropriate sequences for amplifying expression. 

25 In addition, the expression vectors preferably contain one or more selectable 

marker genes to provide a phenotypic trait for selection of transformed host cells such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. coli. 

The gene can be placed under the control of a promoter, ribosome binding site (for 

30 bacterial expression) and. optionally, an operator (collectively referred to herein as 

"control" elements), so that the DNA sequence encoding the desired protein is transcribed 
into RNA in the host cell transformed by a vector containing this expression construction. 
The coding sequence may or may not contain a signal peptide or leader sequence. The 
polypeptides of the present invention can be expressed using, for example, the £. colt tac 

35 promoter or the protein A gene (spa) promoter and signal sequence. Leader sequences can 
be removed by the bacterial host in post-translational processing. See, e.g., U.S. Patent 
Nos. 4.431,739; 4,425,437; 4,338,397. Promoter regions can be selected from any desired 
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»enc using CAT (chloramphenicol transferase) vectors or other vectors with selectable 
markers. Two appropriate vectors are PKK232-8 and PCM 7. Particular named bacterial 
promoters include lacl, lacZ, T3. T7, gpt, lambda Pr, Pl and trp. Eukaryolic promoters 
include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from 
5 retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and promoter is 
well within the level of ordinary skill in the art. 

In addition to control sequences, it may be desirable to add regulatory sequences 
which allow for regulation of the expression of the protein sequences relative to the growth 
of the host cell. Regulatory sequences are known to those of skill in the an. and examples 
1 0 include those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of a regulatory compound. Other 
types of regulatory elements may also be present in the vector, for example, enhancer 
sequences. 

An expression vector is constructed so that the particular coding sequence is 

1 5 located in the vector with the appropriate regulatory sequences, the positioning and 

orientation of the coding sequence with respect to the control sequences being such that the 
coding sequence is transcribed under the "control" of the control sequences (i.e.. RNA 
polymerase which binds to the DNA molecule at the control sequences transcribes the 
codins sequence). Modification of the coding sequences may be desirable to achieve this 

20 end. For example, in some cases it may be necessary to modify the sequence so that it may 
be attached to the control sequences with the appropriate orientation; i.e., to maintain the 
reading frame. The control sequences and other regulatory sequences may be ligated to the 
coding sequence prior to insertion into a vector, such as the cloning vectors described 
above. Alternatively, the coding sequence can be cloned directly into an expression vector 

25 which already contains the control sequences and an appropriate restriction site. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly- 
expressed gene to direct transcription of a downstream structural sequence. The 

30 heterologous structural sequence is assembled in appropriate phase with translation 

initiation and termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N-terminai 
identification peptide imparting desired characteristics, e.g.. stabilization or simplified 

35 purification of expressed recombinant product. 
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The vector containing the appropriate DNA sequence as hereinabove described as 
well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. 

More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into wh.ch a sequence of the invention 
has been mserted, in a forward or reverse orientation. In a preferred aspect of this 
embodiment, the construct further comprises regulatory sequences, including for example 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art. and are commercially available The 
following vectors are provided by way of example. Bacterial: P ET-3 vectors (Stratagene) 

POE70. P QE60. oOE-9 (Qiagen). pbs, P D10, phagescript, psiX174, pbluescript SK pbsks' 

PNH8A. pNH16a. pNH18A. pNH46A (Stratagene); ptrc99a. pKK223-3. pKK233-3. 

PDR540. P RIT5 (Pharmacia). Eukarvotic: pBlueBacIII (Invitrogen), pWLNEO. 

P-SV2CAT, pOG44, P XT1, pSG (Stratagene) pSVK3. pBPV, P MSG. pSVL (Pharmacia). 

However, any other piasmid or vector may be used as long as they are replicable and viable 

in the host. 

Examples of recombinant DNA vectors for cloning and host cells which thev can 
transform include the bacteriophage \ (£. coli), pBR322 (£. coli), pACYC177 (£. coli), 
PKT230 (gram-negative bacteria), pGVl 106 (gram-negative bacteria), pLAFRI (cram- 
negative bacteria). P ME290 (non-£. coli gram-negative bacteria), P HV14 (£. col, and 
Bacillus subtil,,), pBD9 (Bacillus), plJ6I (Streptomyces), pUC6 (Streptomyces), YIp5 
(Saccharomvces), a baculovirus insect cell system, , YCpl9 (Saccharomvces) See 
generally, "DNA Cloning": Vols. I & II, Glover et al. ed. IRL Press Oxford ( 1985) d 987) 
and: T. Maniatis et al. ("Molecular Cloning" Cold Spring Harbor Laboratory ( 1 982). 

In some cases, .t may be desirable to add sequences which cause the secret.on of 
the polypeptide from the host organism, with subsequent cleavage of the secretory signal. 

Polypeptides can be expressed in host cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins 
usmg RNAs derived from the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with prokaryotic and eukarvotic hosts are described 
by Sambrook. et al.. Molecular Cloning. A Laboratory Manual. Second Edition Cold 
Spring Harbor. N.Y.. (1989), the disclosure of which is hereby incorporated by reference. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.s., 
temperature shift or chemical induction) and cells are cultured for an additional period. 
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Cells are typically harvested by centrifugation. disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or 
5 use of cell lysing agents, such methods are well known to those skilled in the art. 

Depending on the expression system and host selected, the polypeptide of the 
present invention may be produced by growing host cells transformed by an expression 
vector described above under conditions whereby the polypeptide of interest is expressed. 
The polypeptide is then isolated from the host cells and purified, [f the expression system 
10 secretes the polypeptide into growth media, the polypeptide can be purified directly from 
the media. If the polypeptide is not secreted, it is isolated from cell iysates or recovered 
from the cell membrane fraction. Where the polypeptide is localized to the cell surface, 
whole cells or isolated membranes can be used as an assayable source of the desired gene 
product. Polypeptide expressed in bacterial hosts such as E. coli may require isolation from 
] 5 inclusion bodies and refolding. Where the mature protein has a very hydrophobic region 
which leads to an insoluble product of overexpression, it may be desirable to express a 
truncated protein in which the hydrophobic region has been deleted. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by 
20 methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
25 (HPLC) can be employed for final purification steps. 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial methionine amino acid residue. 
A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that 
30 functions as an autonomous unit of DNA replication in vivo; i.e., capable of replication 
under its own control. 

A "vector" is a replicon, such as a plasmid, phage, or cosmid, to which another 
DNA segment may be attached so as to bring about the replication of the attached segment. 
A "double-stranded DNA molecule" refers to the polymeric form of 
35 dcoxyribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded 
helix, both relaxed and supercoiled. This term refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 
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term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g., 
restriction fragments), viruses, piasmids, and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein according 
to the normal convention of giving only the sequence in the 5' to 3' direction along the 
5 nontranscribed strand of DNA (i.e., the strand having the sequence homologous to the 
mRNA). 

A DNA "coding sequence of or a "nucleotide sequence encoding" a particular 
protein. is a DNA sequence which is transcribed and translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. 

10 A "promoter sequence" is a DNA regulatory region capable of binding RNA 

polymerase in a cell and initiating transcription of a downstream (3' direction) coding 
sequence. For purposes of defining the present invention, the promoter sequence is bound 
at the 3* terminus by a translation start codon (e.g., ATG) of a coding sequence and extends 
upstream (5' direction) to include the minimum number of bases or elements necessary' to 

1 5 initiate transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site (conveniently defined by mapping with nuclease 
SI), as well as protein binding domains (consensus sequences) responsible for the binding 
of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" 
boxes and "CAT" boxes. Prokaryotic promoters contain Shine-Dalgarno sequences in 

20 addition to the -10 and -35 consensus sequences. 

DNA "control sequences" refers collectively to promoter sequences, ribosome 
binding sites, polyadenylation signals, transcription termination sequences, upstream 
regulatory domains, enhancers, and the like, which collectively provide for the expression 
( i.e., the transcription and translation) of a coding sequence in a host cell. 

25 A control sequence "directs the expression" of a coding sequence in a cell when 

RNA polymerase will bind the promoter sequence and transcribe the coding sequence into 
mRNA, which is then translated into the polypeptide encoded by the coding sequence. 

A "host cell" is a cell which has been transformed or transfected, or is capable of 
transformation or transfection by an exogenous DNA sequence. 

50 A ceil has been "transformed" by exogenous DNA when such exogenous DNA has 

been introduced inside the cell membrane. Exogenous DNA may or may not be integrated 
(covaiently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an 
episomal element, such as a plasmid. With respect to eukaryotic cells, a stably transformed 

35 or transfected cell is one in which the exogenous DNA has become integrated into the 

chromosome so that it is inherited by daughter cells through chromosome replication. This 
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stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cells containing the exogenous DNA. 

A "clone 1 ' is a population of cells derived from a single cell or common ancestor by 
mitosis. A "cell line" is a clone of a primary cell that is capable of stable growth in vitro 
5 for many generations. 

A "heterologous" region of a DNA construct is an identifiable segment of DNA 
within or attached to another DNA molecule that is not found in association with the other 
molecule in nature. 

Each of the DNA sequences provided herein may be used in the discovery and 
10 development of antibacterial compounds. The encoded protein upon expression can be 

used as a target for the screening of antibacterial drugs. Additionally, the DNA sequences 
encoding the amino terminal regions of the encoded protein or Shine-Delgarno or other 
translation facilitating sequences of the respective mRNA can be used to construct 
antisense sequences to control the expression of the coding sequence of interest. 
1 5 in accordance with yet another aspect of the present invention, there are provided 

inhibitors to such polypeptides, useful as antibacterial agents. Inhibitors of NAGPU 
proteins have utility in anti-bacterial therapy. 

Another aspect of the invention is a pharmaceutical composition comprising an 
inhibitor of the invention and a pharmaceutical ly acceptable carrier. 
20 In a particular aspect the invention provides the use of an inhibitor of the invention 

as an antibacterial agent. 

The invention further relates to the manufacture of a medicament for such uses. 
This invention provides a method of screening drugs to identify those which are 
antibacterial by measuring the ability of the drug to interfere with the biosynthesis of 
25 uridvl diphosphate N-acetyl glucosamine by the NAGPU protein. 

It has been shown that E.coli NAGPU will act as a pyrophosphorylase, catalyzing 
the reverse reaction to N-acetyl glucosamine- 1 -phosphate from the products of the forward 
reaction, UDP-N-acetyl glucosamine and pyrophosphate (Strominger, J.R. and Smith, M.S. 
[1959] J. Biol. Chem. 234: 1822-7). By introducing an inorganic pyrophosphatase into the 
30 reaction it will proceed in the forward direction without limit (Mengin-Lecreulx. D. and 
van Heijenoort, J , J. BacterioL 176: 5788-5795 [1994]). 

In a preferred embodiment, N-acetylgiucosamine- 1 -phosphate is incubated with 
UTP and inorganic pyrophosphatase in the presence of the NAGPU protein to generate 
inonzanic phosphate which can be measured colorimetrically using a suitably sensitive 
35 procedure such as the Malachite Green method (ltaya,K. & ULM. Clin. Chim. Acta 14,361- 
366 [1966) to provide a measurement of NAGPU enzymatic activity. The decrease of 
enzymatic activity in this reaction would indicate the presence of an inhibitor. 
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The invention also relates to inhibitors identified thereby. 

In therapy or as a prophylactic, the active agent may be administered to a patient as 
an injectable composition, for example as a sterile aqueous dispersion, preferably isotonic. 
Alternatively the composition may be formulated for topical application 
5 for example in the form of ointments, creams, lotions, eye ointments, eve drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain appropriate 
conventional additives, including, for example, preservatives, solvents to assist drug 
penetration, and emollients in ointments and creams. Such topical formulations may also 
contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
10 or oleyl alcohol for lotions. Such carriers may constitute from about 1% to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the formulation. 

For administration to human patients, it is expected that the daily dosage level of 
the active agent will be from 0.0 1 to 10 mg/kg, typically around 1 mg/kg. The physician in 
1 5 any event will determine the actual dosage which will be most suitable for an individual 

patient and will vary with the age, weight and response of the particular patient. The above 
dosages are exemplary of the average case. There can, of course, be individual instances 
where higher or lower dosage ranges are merited, and such are within the scope of this 
invention. 

20 Within the indicated dosage range, no adverse toxicoiogical effects are expected 

with the compounds of the invention which would preclude their administration to suitable 

patients. 

Examples 

The present invention is further described by the following examples. These 
25 exemplification's, while illustrating certain specific aspects of the invention, do not portray 
the limitations or circumscribe the scope of the disclosed invention. 

In order to facilitate understanding of the following examples certain frequently 
occurring methods and/or terms will be described. 

"Plasm ids" are designated by a lower case p preceded and/or followed by capital 
30 letters and/or numbers. The starting plasmids herein are either commercially available, 

publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures. In addition, equivalent plasmids to those described are 
known in the an and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 
35 enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and their reaction conditions, cofactors and other 
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requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically I ug of plasmid or DNA fragment is used with about 2 units of enzyme 
in about 20 ul of buffer solution. For the purpose of isolating DNA fragments for plasmid 
construction, typically 5 to 50 ug of DNA arc digested with 20 to 250 units of enzyme in a 
5 lamer volume. Appropriate buffers and substrate amounts for particular restriction 

enzvmes are specified by the manufacturer. Incubation times of about 1 hour at 37° C are 
ordinarily used, but may vary in accordance with the supplier's instructions. After digestion 
the reaction is clcctrophoresed directly on a polyacrylamide gel to isolate the desired 
fragment. 

$j Z e separation of the cleaved fragments is performed using 8 percent 

pol\acr\lamide gel described by Goeddel, D. et a/.. Nucleic Acids Res., 8:4057 (1980). 

"Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 

cornplementarv polydeoxynucleotide strands which may be chemically synthesized. Such 

svnthetic oiieonucleotides have no 5' phosphate and thus will not ligate to another 

1 5 oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 

>vnthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

"Ligation" refers to the process of forming phosphodiester bonds between two 

double stranded nucleic acid fragments (Maniatis, T., et al.. Id., p. 146). Unless otherwise 

provided, ligation may be accomplished using known buffers and conditions with 10 units 

20 to T4 DNA ligase ("ligase") per 0.5 ug of approximately equimolar amounts of the DNA 

fragments to be ligated. 

All examples were carried out using standard techniques, which are well known and 

routine to those of skill in the art, except where otherwise described in detail. Routine 

molecular biology techniques of the following examples can be carried out as described in 

25 standard laboratory manuals, such as Sambrook et al., MOLECULAR CLONING: A 

LABORATORY iVIANUAL. 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor. N Y. (1989). 

Example 1 

Isolation of DNA coding for a NAGPU Protein from 

30 S. Aureus VVCUH 29 

The polynucleotide having the DNA sequence given in SEQ ID NO.l was obtained 
from a library of clones of chromosomal DNA of S.aureus WCUH 29 in E.coli. In some 
cases the sequencing data from two or more clones containing overlapping S.aureus 
WCUH 29 DNA was used to construct the contiguous DNA sequence in SEQ ID NO: I 

35 Libraries may be prepared by routine methods, for example: 
Methods 1 and 2 
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Total cellular DNA is isolated from Staphylococcus aureus strain VVCUH29 
(NCIMB 40771) according to standard procedures and size-fractionated by either of two 
methods. 
Method I. 

5 Total cellular DNA is mechanically sheared by passage through a needle in order to 

size-fractionate according to standard procedures. DNA fragments of up to I Ikbp in size 
are rendered blunt by treatment with exonucleasc and DNA polymerase, and EcoRI linkers 
added. Fragments are ligated into the vector Lambda Zapll that has been cut with EcoRI, 
the library packaged by standard procedures and E.coli infected with the packaged library. 
1 0 The library is amplified by standard procedures. 
Method 2. 

Total cellular DNA is partially hydrolyzed with a combination of four restriction 
enzymes (Rsal. Pall. Alui and Bsh 12351) and size-fractionated according to standard 
procedures. EcoRI linkers are ligated to the DNA and the fragments then ligated into the 
1 5 vector Lambda Zapll that have been cut with EcoRI, the library packaged by standard 
procedures, and E.coli infected with the packaged library. The library is amplified by 
standard procedures. 
Example 2 

The determination of expression during infection of a gene from Staphylococcus 
20 aureus WCUH29 

Necrotic fatty tissue from a four day groin infection of Staphylococcus aureus 
WCUH29 in the mouse is efficiently disrupted and processed in the presence of chaotropic 
agents and RNAase inhibitor to provide a mixture of animal and bacterial RNA. The 
optimal conditions for disruption and processing to give stable preparations and high yields 

25 of bacterial RNA are followed by the use of hybridization to a radiolabeled oligonucleotide 
specific to Staphylococcus aureus 16S RNA on Northern blots. The RNAase free. DNAase 
free, DNA and protein free preparations of RNA obtained are suitable for Reverse 
Transcription PCR (RT-PCR) using unique primer pairs designed from the sequence of 
each gene of Staphylococcus aureus WCUH29. 

30 a) Isolation of tissue infected with Staphylococcus aureus WCUH29 from a mouse 
animal model of infection 

10 ml. volumes of sterile nutrient broth (No.2 Oxoid) are seeded with isolated, 
individual colonies of Staphylococcus aureus WCUH29 from an agar culture plate. The 
cultures are incubated aerobically (static culture) at 37 degrees C for 16-20 hours . 4 week 

35 old mice (female. 1 8g-22g. strain MFI) are each infected by subcutaneous injection of 
0.5ml. of this broth culture of Staphylococcus aureus WCUH29 (diluted in broth to 
approximately I0 s cfu/ml.) into the anterior . right lower quadrant (groin area). Mice should 
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be monitored regularly during the first 24 hours after infection, then daily until termination 
of study. Animals with signs of systemic infection, i.e. lethargy, ruffled appearance, 
isolation from group, should be monitored closely and if signs progress to moribundancy, 
the animal should be culled immediately. 
5 Visible external signs of lesion development will be seen 24-48h after infection. 

Examination of the abdomen of the animal will show the raised outline of the abscess 
beneath the skin. The localized lesion should remain in the right lower quadrant, but may 
occasionally spread to the left lower quadrant, and superiorly to the thorax. On occasions, 
the abscess may rupture through the overlying skin layers. In such cases the affected 
10 animal should be culled immediately and the tissues sampled if possible. Failure to cull the 
animal may result in the necrotic skin tissue overlying the abscess being sloughed off, 
exposing the abdominal muscle wall. 

Approximately 96h after infection, animals are killed using carbon dioxide 
asphyxiation. To minimize delay between death and tissue processing /storage, mice 
1 5 should be killed individually rather than in groups. The dead animal is placed onto its back 
and the fur swabbed liberally with 70% alcohol. An initial incision using scissors is made 
through the skin of the abdominal left lower quadrant, traveling superiorly up to, then 
across the thorax. The incision is completed by cutting inferior ly to the abdominal lower 
right quadrant. Care should be taken not to penetrate the abdominal wall. Holding the skin 
20 flap with forceps, the skin is gently pulled way from the abdomen. The exposed abscess, 
which covers the peritoneal wall but generally does not penetrate the muscle sheet 
completely, is excised, taking care not to puncture the viscera 

The abscess/muscle sheet and other infected tissue may require cutting in sections, 
prior to flash-freezing in liquid nitrogen, thereby allowing easier storage in plastic 
25 collecting vials. 

b) Isolation of Staphylococcus aureus WCUH29 RNA from infected tissue samples 

4-6 infected tissue samples(each approximately 0.5-0.7g) in 2ml screw-cap tubes 
are removed from -80°C storage into a dry ice ethanol bath In a microbiological safety 
cabinet the samples are disrupted individually whilst the remaining samples are kept cold in 
30 the dry ice ethanol bath. To disrupt the bacteria within the tissue sample I ml of TRlzol 
Reagent (Gibco BRL, Life Technologies) is added followed by enough 0.1mm 
zirconia/silica beads to almost fill the tube, the lid is replaced taking care not to get any 
beads into the screw thread so as to ensure a good seal and eliminate aerosol generation. 
The sample is then homogenized in a Mini-BeadBeater Type BX-4 (Biospec Products). 
35 Necrotic fatty tissue is treated for 100 seconds at 5000 rpm in order to achieve bacterial 
lysis. In vivo grown bacteria require longer treatment than in vino grown S. aureus 
WCUH29 which are disrupted by a 30 second bead-beat. 
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After bead-beating the tubes are chilled on ice before opening in a fume-hood as 
heat generated during disruption may degrade the TRIzol and release cyanide. 

200 microiitres of chloroform is then added and the tubes shaken by hand for 15 
seconds to ensure complete mixing. After 2-3 minutes at room temperature the tubes are 
spun down at 1 2,000 x g, 4 °C for 1 5 minutes and RNA extraction is then continued 
according to the method given by the manufacturers of TRIzol Reagent i.e.:- The aqueous 
phase, approximately 0.6 ml, is transferred to a sterile Eppendorf tube and 0.5 ml of 
isopropanoi is added. After 10 minutes at room temperature the samples are spun at 12,000 
\ g. 4 °C for 10 minutes. The supernatant is removed and discarded then the RNA pellet is 
washed with 1 ml 75% ethanol. A brief vortex is used to mix the sample before 
.entr.fug.ng at 7.500 x g, 4 °C for 5 minutes. The ethanol is removed and the RNA pellet 
dried under vacuum for no more than 5 minutes. Samples are then resuspended by repeated 
pipetting in !00 microiitres of DEPC treated water, followed by 5-10 minutes at 55 °C. 
f inally, after at least 1 minute on ice, 200 units of Rnasin (Promega) is added. 
15 RNA preparations are stored at -80 °C for up to one month. For longer term storage 

the RNA precipitate can be stored at the wash stage of the protocol in 75% ethanol for at 
least one year at -20 °C. 

Quality of the RNA isolated is assessed by running samples on 1% agarose gels. I 
x TBE gels stained with ethidium bromide are used to visualize total RNA yields. To 
0 demonstrate the isolation of bacterial RNA from the infected tissue I x MOPS. 2.2M 

formaldehyde gels are run and vacuum blotted to Hybond-N (Amersham). The blot is then 
hybridized with a 32 P labeled oligonucletide probe specific to 16s rRNA of S.aureus 
(K.Greisen. M. Loeffelholz, A. Purohit and D. Leong. J.Clin. (1994) Microbiol. 32 335- 
351). An oligonucleotide of the sequence: S'-gctcctaaaaggttactccaccggc^' [SEQ ID NO:6] 
5 is used as a probe. The size of the hybridizing band is compared to that of control RNA 
isolated from in vitro grown S.aureus WCUH29 in the Northern blot. Correct sized 
bacterial 16s rRNA bands can be detected in total RNA samples which show extensive 
degradation of the mammalian RNA when visualized on TBE gels, 
c) The removal of DNA from Staphylococcus aureus WCUH29 derived RNA 
0 DNA was removed from 73 microlitre samples of RNA by a 1 5 minute treatment 

on ice with 3 units of DNAasel. amplification grade (Gibco BRL, Life Technologies) in the 
buffer supplied with the addition of 200 units of Rnasin (Promega) in a final volume of 90 
microiitres. 

The DNAase was inactivated and removed by treatment with TRIzol LS Reagent 
5 (Gibco BRL. Life Technologies) according to the manufacturers protocol. 

DNAase treated RNA was resuspended in 73 microiitres of DEPC treated water with the 
addition of Rnasin as described in Method 1 . 
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d) The preparation of cDNA from RNA samples derived from infected tissue 

10 microlitre samples of DNAasc treated RNA are reverse transcribed using a 
Superscript Preamplification System for First Strand cDNA Synthesis kit (Gibco BRL, Life 
Technologies) according to the manufacturers instructions. ) nanogram of random 
5 hexamers is used to prime each reaction. Controls without the addition of SuperScriptH 
reverse transcriptase are also run. Both +/-RT samples are treated with RNaseH before 
proceeding to the PCR reaction 

c) The use of PCR to determine the presence of a bacterial cDNA species 
PCR reactions are set up on ice in 0.2ml tubes by adding the following 
10 components: 

45 microiitres PCR SUPERM1X (Gibco BRL, Life Technologies). 
1 microlitre 50mM MgCl 2 , to adjust final concentration to 2.5mM. 

1 microlitre PCR primers(optimally 18-25 basepairs in length and 
designed to possess similar annealing temperatures), each primer at 

1 S I OmM initial concentration. 

2 microiitres cDNA. 

PCR reactions are run on a Perkin Elmer GeneAmp PCR System 9600 as follows: 
5 minutes at 95 °C, then 50 cycles of 30 seconds each at 94 °C, 42 °C and 
72 °C followed by 3 minutes at 72 °C and then a hold temperature of 4 °C. (the number of 
20 cycles is optimally 30-50 to determine the appearance or lack of a PCR product and 
optimally 8-30 cycles if an estimation of the starting quantity of cDN A from the RT 
reaction is to be made). 

10 microlitre aliquots are then run out on 1% 1 x TBE gels stained with ethtdium 
bromide with PCR product, if present, sizes estimated by comparison to a 100 bp DNA 
25 Ladder (Gibco BRL, Life Technologies). Alternatively if the PCR products are 

conveniently labeled by the use of a labelled PCR primer (e.g. labelled at the 5'end with a 
dye) a suitable aliquot of the PCR product is run out on a polyacrylamide sequencing gel 
and its presence and quantity detected using a suitable gel scanning system (e.g. ABI 
Prism™ 377 Sequencer using GeneScan™ software as supplied by Perkin Elmer) 
30 RT/PCR controls may include +/- reverse transcriptase reactions, 16s rRNA 

primers or DNA specific primer pairs designed to produce PCR products from non- 
transcribed S. aureus WCUH29 genomic sequences. 

To test the efficiency of the primer pairs they are used in DNA PCR with WCUH29 
total DNA. PCR reactions are set up and run as described above using approx. I microgram 
35 of DNA in place of thccDNA and 35 cycles of PCR. 
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Primer pairs which fail to give the predicted sized product in either DNA PCR or 
RT/PCR are PCR failures and as such are uninformative. Of those which give the correct 
size product with DNA PCR two classes are distinguished in RT/PCR: 

1 Genes which arc not transcribed in vivo reproducibly fail to cive a product in 
5 RT/PCR. 

Z.Genes which are transcribed in vivo reproducibly give the correct size product in 
RT/PCR and show a stronger signal in the +RT samples than the signal (if at all present) in 
-RT controls. 

The following nucleotide sequence (SEQ ID NO: I ) was .dentifled in the above test 
1 0 as transcribed in vivo Deduced amino acid sequences are given as SEO ID NO:2 and 5. 
The pair of PCR primers used to identify the gene are given as SEQ ID NOs 3 and 4. 
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SEQUENCE LISTING 

(1} GENERAL INFORMATION 
(i) APPLICANT: SmithKline Beecham p. I.e. 

5 

(li) TITLE OF THE INVENTION: NOVEL NAPU 

(iii) NUMBER OF SEQUENCES: 6 

10 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 
<B) STREET: 709 Swedeland Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

15 (E) COUNTRY: U.S.A. 

(F) ZIP: 19406-0939 

{v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

20 (B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

<vi) CURRENT APPLICATION DATA: 
25 (A) APPLICATION NUMBER: 

(B) FILING DATE: 05-DEC-1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 
30 (A) APPLICATION NUMBER: 60/027972 

(Bi FILING DATE: 08-OCT-1996 

(A) APPLICATION NUMBER: 9604580.2 

(B) FILING DATE: 04-MAR-1996 



35 



(A) APPLICATION NUMBER: 9612887.1 

(B) FILING DATE: 26-MAY-1996 



(A) APPLICATION NUMBER: 9604581.0 

40 (B) FILING DATE: 04-MAR-1996 

(A) APPLICATION NUMBER: 9525246.6 

(3) FILING DATE: ll-DEC-1995 
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(viii) ATTORNEY /AGENT INFORMATION : 

(A) NAME: Gimmi, Edward R 

(B) REGISTRATION NUMBER: 38,391 

(C) REFERENCE/ DOCKET NUMBER: P31302 

5 

{xx) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4478 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 

10 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1449 base pairs 

<B) TYPE: nucleic acxd 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
{v) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

25 



30 



35 



40 



txi) 


SEQUENCE DESCRIPTION : 


SEQ ID NO: 


1 : 






AATATTACAT 


AGGATTATCC 


TTTTATCCTA 


GCCATTTTAA 


ATACGTAAAT 


GATTATTTCC 


60 


TATATCCTTC 


TTTTGTTGTT 


TGTCTTGCTC 


TTGCCACAGC 


TAAACTGTCA 


TTTGGTACGT 


120 


CATCTGTGAT 


TGTGGAACCA 


GCTGCCACCA 


ATACATCATC 


ACCAATTGTT 


ACAGGTGCTA 


180 


CTAAATTAAC 


ATTGCAACCT 


ACAAATGAAT 


CTTTGCCGAC 


GATAGTTTTA 


AATTTATTTT 


240 


CACCATCATA 


GTTAACTGTA 


ATCGTTCCGC 


AACCAATATT 


AGTACGTTCG 


CCAATTACAG 


300 


CATCGCCAAT 


ATAACTTAAA 


TGTGAAACCT 


TGGCACCATC 


TTTAAGATCT 


GCTTTTTTAA 


360 


TTTCTACAAA 


ATTTCCAACC 


TTAACATCTG 


CACCTAATTG 


CGCGCCTGGT 


CTCAATTGCG 


420 


CAAACGGTCC 


GACCTTAGTA 


TTAGCTCCTA 


CGCTAGCATC 


ATTAACAACA 


GACTGTTGAA 


480 


7ACATGCACC 


ATTTTCAATC 


GTACTATTGT 


TAATTTCAGA 


GTACTGACCA 


ATAACAACAT 


540 


CTTCGCCAAT 


TTCTGTACGA 


CCATTAATTC 


GTACGCCTGG 


TTCAATGACT 


GTATCACTAC 


600 


CAATTGTAAC 


GTCTGGACCA 


ATAAAAGTGC 


TGTCAGGATC 


GATGATTGTC 


ACACCATTTA 


660 


GCATGTGATA 


ATGATTCGTA 


CGACGTTGCA 


TCGCCTTCTC 


AGCCTGACTA 


AGCATTACAC 


720 


gatcatttac 


ACCCATGATT 


TCTTCAACAT 


CATTGGTACG 


ATACACTTCT 


ACGATGCCGC 


780 


CATCATTTAA 


AATTAACGAC 


AATACATCAG 


GGAGGTAATA 


TTCACCTTGC 


G CAT TAT CAT 


840 


TTTTCACTTG 


TGTTAATTTT 


TCAAACAACG 


•yTTj ATT * T T 


AAAC G C AAAA 


ATACCTGAAC 


900 


TAATTTCATT 


AATATCCTTT 


TCAGCTTGCG 


TTGCATCTTT 


CTCTTCAACT 


ATGCGTTCTA 


960 


AACGACCTGA 


CGCATTTCGA 


ACGATTCTTC 


CGTATCCATA 


TGGTTGTTGA 


ATCGA7GCAG 


1020 
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10 



25 



AT AAT AC AG T TGCTTGAGCA TTAGCATCCT CATGATGCGC AATCAATGTT TCTAATGTTT 1080 

CCTTTGTGAT GAGCGGTGTG TCACCACATA CAACGATTGT CGTACCTTCC TTGTCTTCTA 1140 

AGTGTGATTT CGCCATTTGC ACTGCATGCG CAGTACCGAG TTGTTCCTCT TGAAAACTGT 12 00 

ATAAAGAACG CTCGCCTAAA TGTCCTTTTA CACTTTCAGC ACCATGTCCT ACGATGGTTA 1260 

CAACTTGATC GACACCAGAG CCTTTCACAC TTTCCAATAC ATGTTCGACC ATAGGTTTCC 1320 

CAGCAACCTC GTGTAGCACT TTATACTTTT TAGATTTCAT TCTTGTGCCT TTACCTGCTG 1380 
C C AAAAT TAT CGCGTGTCTT CGCATGAACA TTAACCCCCA TTAAATTTTA CACTTCGTCA 
TTATTATAA 

(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 50 amino acids 

(B) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 
20 (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Arg Arg His Ala He He Leu Ala Ala Gly Lys Giy Thr Arg Met 

1 5 10 1- 

Lys Ser Lys Lys Tyr Lys Val Leu His Glu Val Ala Gly Lys Pro Met 
20 25 30 

30 Val Glu His Val Leu Glu Ser Val Lys Gly Ser Gly Val Asp Gin Val 
3S 40 45 

Val Thr He Val Gly His Gly Ala Glu Ser Val Lys Gly His Leu Gly 

50 55 60 

Glu Arg Ser Leu Tyr Ser Phe Gin Glu Glu Gin Leu Gly Thr Ala His 
35 65 70 75 80 

Ala Val Gin Met Ala Lys Ser His Leu Glu Asp Lys Glu Gly Thr Thr 

85 90 95 

He Val Val Cys Gly Asp Thr Pro Leu He Thr Lys Glu Thr Leu Glu 
100 105 HO 

40 Thr Leu He Ala His His Glu Asp Ala Asn Ala Gin Ala Thr Val Leu 
115 120 125 

Ser Ala Ser He Gin Gin Pro Tyr Gly Tyr Giy Arg lie Val Arg Asn 
130 135 1^0 



1440 
1449 
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15 



Ala Ser Gly Arg Leu Glu Arg He Val Glu Glu Lvs Asp Ala Thr Gin 
145 150 155 iso 

Ala Glu Lys Asp lie Asn Glu lie Ser Ser Gly lie Phe Ala Phe Asn 
165 170 17', 

5 Asn Lys Thr Leu Phe Glu Lys Leu Thr Gin Val Lys Asn Asp Asn Ala 
130 185 190 

Gin Gly Glu Tyr Tyr Leu Pro Asp Val Leu Ser Leu He Leu Asn Asp 

195 200 205 

Gly Gly He Val Glu Val Tyr Arg Thr Asn Asp Val Glu Glu lie Met 
10 210 215 220 

Gly Val Asn Asp Arg Val Met Leu Ser Gin Ala Giu Lys Ala Met Gin 
225 23° 235 240 

Arg Arg Thr Asn His Tyr His Met Leu Asn Gly Val Thr He He Asp 

245 250 255 

Pro Asp Ser Thr Phe lie Gly Pro Asp Val Thr He Gly Ser Asp Thr 

250 265 270 

Val He Glu Pro Gly Val Arg He Asn Gly Arg Thr Glu He Giy Glu 

275 280 285 

Asp Val Val lie Gly Gin Tyr Ser Glu He Asn Asn Ser Thr He Glu 
20 290 295 300 

Asn Gly Ala Cys He Gin Gin Ser Val Val Asn Asd Ala Ser Val Gly 
305 310 315 320 

Ala Asn Thr Lys Val Gly Pro Phe Ala Gin Leu Arg Pro Gly Ala Gin 
325 330 335 

25 Leu Gly Ala Asp Val Lys Val Gly Asn Phe Val Giu He Lys Lys Ala 

340 345 350 

Asp Leu Lys Asp Gly Ala Lys Val Ser His Leu Ser Tyr He Gly Asp 

355 360 3 6 5 

Ala Val He Gly Glu Arg Thr Asn He Gly Cys Gly Thr He Thr Val 
30 370 375 380 

Asn Tyr Asp Giy Glu Asn Lys Phe Lys Thr He Vai Gly Lys Asp Ser 
385 3 90 395 400 

Phe Val Gly Cys Asn Val Asn Leu Val Ala Pro Val Thr He Gly Asp 
405 410 415 

35 Asp Val Leu Val Ala Ala Gly Ser Thr He Thr Asp Asp Val Pro Asn 

420 425 430 

Asp Ser Leu Ala Val Ala Arg Ala Arg Gin Thr Thr Lys Glu Gly Tyr 
435 440 445 

Arg Lys 
40 4 50 

{?.) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS : 

26 
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(A) LENGTH: 18 base pairs 
(E) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D] TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: Genomic DMA 
<iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

10 (vi) ORIGINAL SOURCE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
TATATTCATT ATTTAACA 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 4 : 
TGGTGTGCGG AGCGATTA 

(2) INFORMATION FOR SEQ ID NO: 5: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
40 { D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(Lii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

27 
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20 
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(v! FRAGMENT TYPE: N-terminal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



10 



15 



20 



25 



30 



35 



40 



Met ?he Met Arg Arg His Ala He He Leu Ala Ala Glv Lys dy - h r 

15 "> 15 

Arg Met Lys Ser Lys Lys Tyr Lys Val Leu His Glu Val Ala Glv Lys 

20 25 30 ' 

Pro Met val Glu Hxs Val Leu Glu Ser Val Lys Gly Ser Glv Va] *sp 

35 40 45 

Gin Val Val Thr He Val Gly His Gly Ala Glu Ser Val Lys Gly His 

50 55 60 

Leu Gly Glu Arg Ser Leu Tyr Ser Phe Gin Glu Glu Gin Leu Gly Thr 

65 70 75 B0 

Ala His Ala Val Gin Met Ala Lys Ser His Leu Glu Asp Lys Glu Gly 

85 ?0 Q5 

Thr Thr He Val Val Cys Gly Asp Thr Pro Leu He Thr Lys Glu Thr 

ioo 105 11Q 

Leu Glu Thr Leu He Ala His His Glu Asp Ala Asn Ala Gin Ala Thr 

115 120 125 

Val Leu Ser Ala Ser lie Gin Gin Pro Tyr Gly Tyr Gly Arg He Val 

130 135 140 

Arg Asn Ala Ser Gly Arg Leu Glu Arg He Val Glu Glu Lys Asp Ala 
145 150 155 l60 

Thr Gin Ala Glu Lys Asp He Asn Glu He Ser Ser Gly He Phe Ala 

165 1^0 175 

Phe Asn Asn Lys Thr Leu Phe Glu Lys Leu Thr Gin Val Lys Asr £sp 

180 185 190 

Asn Ala Gin Gly Glu Tyr Tyr Leu Pro Asp Val Leu Ser Leu lie Leu 

195 200 205 

Asn Asp Gly Gly He Val Glu Val Tyr Arg Thr Asn Asp Val G u Glu 

210 215 220 

He Met Gly Val Asn Asp Arg Val Met Leu Ser Gin Ala Glu Lys Ala 
225 230 235 240 

Met Gin Arg Arg Thr Asn His Tyr His Met Leu Asn Gly Val Thr He 

245 250 255 

He Asp Pro Asp Ser Thr Phe He Gly Pro Asp Val Thr lie Glv Ser 

260 265 270 

Asp Thr Val He Glu Pro Gly Val Arg He Asn Gly Arg Thr Glu lis 

275 280 285 

Gly Glu Asp Val Val He Gly Gin Tyr Ser Glu He Asr, *sn W — 
290 295 3Q0 
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He Glu Asn Gly 
305 

Val 31 y Ala Asn 

5 Ala Gin Leu Gly 
340 

:.ys Ala Asp Leu 
355 

Giy Asp Ala Val 
10 370 

Thr Val Asn Tyr 
: P. r 

A.-:p :er Fr»c Val 

15 ;;y Asp Aj;: Val 

^20 

I r o Asn Asp Ser 

Gly Tyr Arg Lys 
20 450 



Ala Cys He Gin Gin Ser 
310 

Thr Lys Val Gly Pro Phe 
325 330 
Ala Asp Val Lys Val Gly 
345 

Lys Asp Gly Ala Lys Val 
360 

lie Gly Glu Arg Thr Asn 
375 

Asp Gly Glu Asn Lys Phe 
390 

Gly Cys Asn Val Asn Leu 
405 410 
Leu Val Ala Ala Gly Ser 
425 

Leu Ala Val Ala Arg Ala 
440 



Val Val Asn Asp Ala Ser 
315 320 
Ala Gin Leu Arg Pro Gly 
335 

Asn Phe Val Glu He Lys 
350 

Ser His Leu Ser Tyr He 
365 

He Gly Cys Gly Thr He 
380 

Lys Thr lie Val Gly Lys 
395 400 
Val Ala Pro Val Thr He 
415 

Thr lie Thr Asp Asp Val 
4 30 

Arg Gin Thr Thr Lys Glu 
445 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 25 base pairs 

{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 (ii) MOLECULE TYPE : Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GCTCCTAAAA GGTTACTCCA CCGGC 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc I3bis) 



A. The indications made below relate to the microorganism referred to in (he description 
on page , line ] 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet ["""] 



Name of depositary institution 

National Collection of Industrial and Marine Bacteria Ltd. (NCIMB) 



Address of depositary institution (including postal code and country) 

Aberdeen 
Scotland 



Date of deposit 



11 September 1995 



Accession Number 

NCIMB 40771 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



In respect of those designations in which a European Patent is sought, a sample 
of the deposited micro-organism will be made available until the publication of 
the mention of grant of the European Patent or until the date on which the 
application has been refused or withdrawn, only by the issue of such a sample 
to an expert nominated by the person requesting the sample. 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if Oie indications are not for all desisted States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications iistedbelow will be submitted lo the International Bureau laier \specify the general nature of tite indications eg.. 'Accession 
Number of Deposit') 



For receiving Office use only 



| I This sheet was received with rhe international application 



Authorized officer 



For Intemaiiona) Bureau use only 



I I This sheet was received by the Intemaiiona! Bureau on: 



Authorized officer 



Form PCT/R 0/134 (July 1992) 
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What is claimed is: 



1 . An isolated polynucleotide comprising a member selected from the group 
5 consisting of: 

(a) a poiynucleotide having at least a 70% identity to a polynucleotide encoding 
a polypeptide comprising amino acids 1 to 450 of SEQ ID NO:2; 

(b) a polynucleotide having at least a 70% identity to a polynucleotide encoding 
a polypeptide comprising amino acids 1 to 452 of SEQ ID NO:5; 

]0 (c) a polynucleotide which is complementary to the polynucleotide of (a); 

(d) a polynucleotide which is complementary to the polynucleotide of (b); and 

(e) a polynucleotide comprising at least 15 sequential bases of the polynucleotide 
of(a),(b), (c)or(d). 

2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA. 
15 3. The polynucleotide of Claim I wherein the polynucleotide is RNA. 

4. The polynucleotide of Claim 2 comprising a polynucleotide sequence 
selected from the group consisting of the complementary sequence to nucleotide 54 to 1406 
set forth in SEQ ID NO: 1, and a complementary sequence to nucleotide 54 to 1412 set forth in 
SEQ IDNO:l. 

20 5. The polynucleotide of Claim 2 comprising the polynucleotide set forth in 

SEQ ID NO: 1 that encodes novel NAGPU polypeptide. 

6. The polynucleotide of Claim 2 which encodes a polypeptide comprising 

amino acid selected from the group consisting of amino acid 1 to 450 of SEQ ID NO:2 and 

amino acid 1 to 452 of SEQ IDNO:5. 
25 7. An isolated polynucleotide comprising a member selected from the group 

consisting of: 

(a) a polynucleotide having at least a 70% identity to a polynucleotide encoding 
the same mature polypeptide expressed by the DNA contained in NCIMB Deposit No. 40794 
and having the polynucleotide sequence of SEQ ID NO: I; 
30 (b) a polynucleotide complementary to the polynucleotide of (a); and 

(c) a polynucleotide comprising at least 1 5 bases of the polynucleotide of (a) or 

(b). 

8. A vector comprising the DNA of Claim 2. 

31 
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9. A host cell comprising the vector of Claim 8. 

1 0. A process for producing a polypeptide comprising: expressing from the host 
cell of Claim 9 a polypeptide encoded by said DNA. 

IK A process for producing a cell which expresses a polypeptide comprising 
5 transforming or transfecting the cell with the vector of Claim 8 such that the cell expresses the 
polypeptide encoded by the cDNA contained in the vector. 

1 2. A polypeptide comprising an amino acid sequence which is at least 70% 
identical to amino acid selected from the group consisting of 1 to 450 of SEQ ID NO:2 and 
ammo acid I to 452 of SEQ ID NO:5. 
10 1 3 - A polypeptide comprising an amino acid sequence selected from the group 

consisting of the amino acids as set forth in SEQ ID NO:2 and the amino acids as set forth in 
SEQ!DNO:5. 

14. An antibody against the polypeptide of claim 12. 

15. An antagonist which inhibits the activity of the polypeptide of claim 12. 

1 5 16. A method for the treatment of an individual having need of novel NAGPU 

protein comprising: administering to the individual a therapeutically effective amount of the 
polypeptide of claim 12. 

1 7. The method of Claim 16 wherein said therapeutically effective amount of the 
polypeptide is administered by providing to the individual DNA encoding said polypeptide 

20 and expressing said polypeptide in vivo. 

1 8. A method for the treatment of an individual having need to inhibit novel 
NAGPU polypeptide comprising: administering to the individual a therapeutically effective 
amount of the antagonist of Claim 15. 

19. A process for diagnosing a disease related to expression of the polypeptide of 
25 claim 12 comprising: 

determining a nucleic acid sequence encoding said polypeptide. 

20. A diagnostic process comprising: 

analyzing for the presence of the polypeptide of claim 12 in a sample derived from a 

host. 

^° 21 . A method for identifying compounds which bind to and inhibit an activity of 

the polypeptide of claim 12 comprising: 

contacting a cell expressing on the surface thereof a binding means for the 
polypeptide, said binding means being associated with a second component capable of 
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providing a detectable signal in response to the binding of a compound to said binding means, 
with a compound to be screened under conditions to permit binding to the binding; and 

determining whether the compound binds to and activates or inhibits the binding by 
detecting the presence or absence of a signal generated from the interaction of the compound 
5 with the binding means. 

22. A method for inducing an immunological response in a mammal which 
comprises inoculating the mammal with novel NAGPU protein, or a fragment or variant 
thereof, adequate to produce antibody to protect said animal from disease. 

23. A method of inducing immunological response in a mammal which 

1 0 comprises, through gene therapy, delivering a gene encoding a novel NAGPU protein 
fragment or a variant thereof, for expressing novel NAGPU protein, or a fragment or a 
variant thereof//? vivo in order to induce an immunological response to produce antibody to 
protect said animal from disease. 

24. An immunological composition comprising a DNA which codes for and 
1 5 expresses a novel NAGPU polynucleotide or protein coded therefrom which, when 

introduced into a mammal, induces an immunological response in the mammal to a given 
novel NAGPU polynucleotide or protein coded therefrom. 
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